Corpus: fin_news_2012_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 97 99 99 99 99
1000 923 994 999 999 999
10000 7153 9644 9950 9989 9995
100000 44989 88603 97600 99440 99796
1000000 44990 88604 97601 99441 99797


Zipf's diagram for sentence endings


Gnuplot diagram

12566 msec needed at 2021-06-30 18:22